NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Chen, Shijie Jimenez_Gutierrez; Bernal; Su, Yu (April 2025, The Thirteenth International Conference on Learning Representations (ICLR), April 24-28, 2025, Singapore.)

Full Text Available
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Wang, Boshi; Yue, Xiang; Su, Yu; Sun, Huan (December 2024, NeurIPS)

Full Text Available
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization

Wang, Boshi; Yue, Xiang; Su, Yu; Sun, Huan (December 2024, NeurIPS)

Full Text Available
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Gutiérrez, Bernal Jiménez; Shu, Yiheng; Gu, Yu; Yasunaga, Michihiro; Su, Yu (December 2024, NeurIPS)

Full Text Available
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

Chowdhury, Arpita; Paul, Dipanjyoti; Mai, Zheda; Gu, Jianyang; Zhang, Ziheng; Mehrab, Kazi Sajeed; Campolongo, Elizabeth G; Rubenstein, Daniel; Stewart, Charles V; Karpatne, Anuj; et al (June 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as bird species. Pre-trained ViTs, such as DINO, have demonstrated remarkable capabilities in extracting localized, discriminative features. However, saliency maps like Grad-CAM often fail to identify these traits, producing blurred, coarse heatmaps that highlight entire objects instead. We propose a novel approach, Prompt Class Attention Map (Prompt-CAM), to address this limitation. Prompt-CAM learns class-specific prompts for a pre-trained ViT and uses the corresponding outputs for classification. To correctly classify an image, the true-class prompt must attend to unique image patches not present in other classes' images (i.e., traits). As a result, the true class's multi-head attention maps reveal traits and their locations. Implementation-wise, Prompt-CAM is almost a "free lunch," requiring only a modification to the prediction head of Visual Prompt Tuning (VPT). This makes Prompt-CAM easy to train and apply, in stark contrast to other interpretable methods that require designing specific models and training processes. Extensive empirical studies on a dozen datasets from various domains (e.g., birds, fishes, insects, fungi, flowers, food, and cars) validate the superior interpretation capability of Prompt-CAM. The source code and demo are available at https://github.com/Imageomics/Prompt_CAM.
more » « less
Full Text Available
Spin relaxation dynamics with a continuous spin environment: The dissipaton equation of motion approach

https://doi.org/10.1063/5.0225734

Ying, Wenxiang; Su, Yu; Chen, Zi-Hao; Wang, Yao; Huo, Pengfei (October 2024, The Journal of Chemical Physics)

We investigate the quantum dynamics of a spin coupling to a bath of independent spins via the dissipaton equation of motion (DEOM) approach. The bath, characterized by a continuous spectral density function, is composed of spins that are independent level systems described by the su(2) Lie algebra, representing an environment with a large magnitude of anharmonicity. Based on the previous work by Suarez and Silbey [J. Chem. Phys. 95, 9115 (1991)] and by Makri [J. Chem. Phys. 111, 6164 (1999)] that the spin bath can be mapped to a Gaussian environment under its linear response limit, we use the time-domain Prony fitting decomposition scheme to the bare–bath time correlation function (TCF) given by the bosonic fluctuation–dissipation theorem to generate the exponential decay basis (or pseudo modes) for DEOM construction. The accuracy and efficiency of this strategy have been explored by a variety of numerical results. We envision that this work provides new insights into extending the hierarchical equations of motion and DEOM approach to certain types of anharmonic environments with arbitrary TCF or spectral density.
more » « less
Full Text Available
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Chen, Ziru; Chen, Shijie; Ning, Yuting; Zhang, Qianheng; Wang, Boshi; Yu, Botao; Li, Yifei; Liao, Zeyi; Wei, Chen; Lu, Zitong; et al (April 2025, International Conference on Learning Representations (ICLR) (ICLR), April 24-28, 2025, Singapore.)

Full Text Available
Fine-Tuning is Fine, if Calibrated

Mai, Zheda; Chowdhury, Arpita; Zhang, Ping; Tu, Cheng-Hao; Chen, Hong-You; Pahuja, Vardaan; Berger-Wolf, Tanya; Gao, Song; Stewart, Charles; Su, Yu; et al (December 2024, NeurIPS)

Full Text Available
A Zero-Power Harmonic Tag for Real-Time Wireless Food Quality Monitoring

https://doi.org/10.1109/LSENS.2024.3416084

Ren, Yichong; Wu, Nanshu; Chang, Kuo-Cheng; Su, Yu-Sheng; Chen, Pai-Yen (July 2024, IEEE Sensors Letters)

Full Text Available
Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

Pahuja, Vardaan; Luo, Weidi; Gu, Yu; Tu, Cheng-Hao; Chen, Hong-You; Berger-Wolf, Tanya; Stewart, Charles; Gao, Song; Chao, Wei-Lun; Su, Yu (October 2024, ACM International Conference on Information and Knowledge Management (CIKM))

Full Text Available

« Prev Next »

Search for: All records